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geneseqn2002a: * 
geneseqn2002b: * 
geneseqn2003a: * 
geneseqn2003b: * 
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AEA00728 

ID AEA00728 standard; DMA; 756 BP. 
XX 

AC AEA00728; 

DT 28- JUL- 2005 (first entry) 
XX 

DE Brevi baci I I us choshi nensi s DMA #1. 
XX 

KW Gel I culture; Br evi baci I I us clioslii nensi s; gene; ds. 
XX 

08 Brevi baci I I us choshi nensi s. 
XX 

FH Key Locat i on/ dual i f i ers 

FT CDS 1. .756 

FT /*tag= a 

FT /product= " B. choshi nensi s protein #1" 

XX 

PN VC2005045005- A1 . 

PD 19-l\AY-2005. 

PF 08-IMCV-2004; 2004W> JP016912. 

PR 11- IMCV-2003; 2003JP- 00381 606. 

PA ( HGET ) HI GETA SHCVU KK. 
XX 

PI Hanagata H, Ni shi j yo T; 

DR W\ ; 2005-366840/37. 

DR P- PSDB; AEA00729. 
XX 

PT New Br evi baci I I us choshi nensi s, t hat does not f or m spor es and whi ch shows 

PT I ow ext racel I ul ar or intracellular pr ot ease act i vi t y, useful as host for 

PT producing reconrbinant protein. 

PS a ai m 4; SEQ I D NO 1 ; 103pp; Japanese. 
XX 

OC The invention relates to a Brevi baci I I us choshi nensi s HPD31-SP3 ( FERIVBP- 

OC 08479), which does not form spores and which has nycol ogi cal 

CO characteristics such as cell size and rod shape and physiological 

CO characteristics such as negative for nitrate reduction and positive for 

CO citric acid utilization, oxidase and catalase, and showing low 

CC extracellular protease activity. The invention also relates to a 

CC t r ansf or n^d B. choshi nensi s usi ng a vect or cont ai ni ng t he gene encodi ng 

CC the protein of the invention. B. choshi nensi s is useful as a host for 

CC producing a reconbinant protein and for producing a protein by culturing 

CC a t r ansf or rred host. It has decreased extracellular protein degradation 

CC activity when corrpared with other strains. This sequence represents B. 

OC choshi nensi s DMA of t he i nvent i on. 

SO Sequence 756 BP; 207 A; 166 0; 211 G; 172 T; 0 U; 0 Q her ; 

Query ^fet ch 100.0% Score 756; DB 18; Length 756; 

Best Local Si ni I ar i t y 100.0% Pred. No. 3e-234; 

IVbtches 756; Conservative 0; M snatches 0; I ndel s 0; Oaps 0; 

0/ 1 ATG^G^"GCCG^TATCAAAAATGGGAG^CAACCATTTCTG^CCAATGACCAAG^GAAAG^T 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATG^G^"GCCG^TATCAAAAATGGGAG^CAACCATTTCTG^CCAATG^CCAAG^GAAAG^T 60 

Cy 61 TTG^TAGCCAAG^GGCAAGCTG^GG^TACGG^TQCACG^GAGCTTC^CG^GAATAGCAAT 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 TTG^TAGCCAAG^GGCAAGC^Q33G^^TACGG^TGCACG^GAGCTTC^CG^GAATAGCAAT 120 

cy 121 ATCAGAC^G3^C^G3^CCG^CG^CCAQCGCTTTATCAACCGGGGG^"ATG^GGGG^TG^T 180 

,2, kvimmiiimiiimmkviM!mmyiii.v^^ ,8o 
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cy 
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cy 


601 


Db 


601 


cy 
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cy 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



301 CQCG^TG^0QG^■ACQG^TMQG^(^G^0G^TCG^^AAMGAMCAQCX3AATMGG^GI0GG 360 

360 

361 0GATCAAAQ3^TGAATTG^■A(^QGAATTCQ^0C!G^■QCmXA0GATCQGAGftACT 420 

I I I I 

■QQGA 420 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
^^^TCAAAG3^TGAATTG^■ACAAQCAATTCG^0C)G^^Q0C)C^}A0GATCGGA(^G^Q(— ' 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GAAQ3AGrQ33AATGACQCCQG^Q»\AGrAGrCn"TTQCQ3AAGAQQGAAQ3AG^QGQCCT 480 

TCCTCGATOGATGAG^OCG^^TTTG^AAATG^OQ^CGATCCGATCAGACTGATCG^TGAG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCCTCGATCGATGAG^(XX?rrTTTGW\ATG^CG3CGATC(XATCAGACTGATCX^ 540 

ATAam\TG^QGrGn-GAAGAAGn-GGnTTG^G'W\ATTQC)CTTGAAQ»\CQGCATCAQC 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATAQCX^3^TG^AQGTGr^GAA(:W^G^"GG^TTG^G^AAATTQCCTTGAAQG^CQ3CA 600 

AQQa^G^QCG^QCG^GAQGAQCTGATCG^CTACC^Q3QG^" ATT ACAAQG^TGAGAGAGAG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGQGTG^G0G^QCG^GAGGAGa^GATCG^CTA0CTQ3GGTATTAGAAGG^TGAGAGAGAG 660 

661 TCTG^Q3^AGCAGAGCGTC^AG^3ATTTCGCAGG^CCAGGTCTCGCGTGTGG^AAAQCG^ 720 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGTG^QGTAGGAGAGCX3TCTAG33ATTTa3CAGGra:AGGn-CTGGCX?rCrQ»WV\G0Gr 720 

721 ATCCTQGTAACGflkTGAAGGftGCAAATTG^GATTAG 756 

FESULT 2 
ABK78469 

ID ABK78469 standard; DMA; 665 BP. 

AG ABK78469; 

DT 13- AUG- 2002 (first entry) 
XX 

DE Bacillus cl ausi i genoni c sequence tag (GST) #1312. 
XX 

KW afferential gene expression; genoni c sequenced tag; GST; 

KW a! t ered cul t ure condi t i on; environmental stress; 

KW physiological provocation; ds. 
XX 

CS Baci I I us cl ausi i . 

PN VC200229113-A2. 

PD 11- APR- 2002. 

PF 05- OCT- 2001; 2001 W> US031 437. 

PR 06- OCT- 2000; 2000US- 00680598. 
PR 27-M^R-2001; 2001 US- 0279526P. 

PA ( NOVO ) NO/CZYIVES Bl Cn"ECH I NC. 
PA ( NCVO ) NO/CZYMES AS. 
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PI Ber ka R, Q ausen I G; 

DR Wl ; 2002-416684/44. 
XX 

PT IVbnitoring differential expression of several genes in first Bacillus 

PT cell relative to expression of same genes in one or more second Bacillus 

PT cells, by using substrate containing Bacillus genoni c sequenced tag 

PT array. 

PS a aim 11; SEQ I D 5760; 200pp; English. 
XX 

CC The invention describes a n^t hod of monitoring differential expression of 

CC genes in a f i rst Baci I I us eel I rel at i ve t o expressi on of t he genes i n 

OC other Bacillus cells, conprising hybridising labelled nucleic acid probes 

CC i sol at ed f r om Baci I I us eel I s to a subst r at e cont ai ni ng ar r ay of Baci I I us 

CC genoni c sequenced tags (GST), exani ni ng the array, and deternining 

CC relative gene expression by an observed hybridisation reporter signal of 

CC a spot i n t he array. The net hod is useful for measuring the expression of 

CC genes in a f i rst Baci I I us eel I rel at i ve t o expressi on of t he same genes 

CC in one or rmire second Bacillus cells. The method is useful for monitoring 

CC global expression of several genes from a Bacillus cell, discovering new 

CC genes, identifying possible functions of unknown open reading frames and 

CC moni t or i ng gene copy nunber var i at i on and st abi I i t y. Moni t or i ng changes 

CC in expression of genes may be used to provide a representation of the way 

OC i n whi eh Baci I I us eel I s adapt t o changes in eul t ure eondi t i ons, 

CC environmental stress or other physiological provocation. Ext ensi ve f ol I ow 

CC -up characterisation is unnecessary, when one spot on an array equals one 

CC gene or one open reading Uavre, since sequence information is available. 

CC This sequence represents a genoni c sequence tag (GST) used i n t he method 

CC of t he invention. Note: The sequence data for this patent did not form 

CC part of the printed specification, but was obtained in electronic format 

CC di r ect I y f r om W PO at f t p. wi po. i nt / pub/ publ i shed_pct _sequences 

SQ Sequence 665 BP; 210 A; 134 C; 159 G; 161 T; 0 U; 1 Q her ; 

Query ^/bt ch 36.2% Score 273.4; DB 6; Length 665; 

Best Local Si ni I ar i t y 67.4% Pred. No. 1.9e-77; 

IVbtches 399; Conservative 0; M snatches 192; I ndel s 1; C^ps 1; 

GAATG^CCAAGrGAAAG^TTTG^TAGCCAAG^GCC/V\QCTGXG^TACQG^TGCA 96 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
«3^TAAACAAGrGAAAG^QCrrATTGCAAAAAGCCAG3AAGXG^CACAG<V\GCA 99 

^GCTTCTCGTG^ATAGCAATATCAG'XCTGGrCTGGrCCGrCGrCCAQCGCTTTATC 1 56 

I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I 
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0/ 
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Db 


100 


0/ 


157 


Db 


160 


cy 


217 


Db 


220 


cy 


277 


Db 


280 


cy 


337 


Db 


340 


cy 


397 


Db 


400 


cy 


457 



AAGTCT 219 



I I I I II II I I I I I I I I I I I I I I I I I I II II I I I I I I II II II II I 



"QCGGGA>TG^CGG^ACGG^T/V\QG^CAG^CG^TCG^TA 336 
"CCATT 339 



I I I I I I I I I I I I I I I I I I I I I I I I I I 
■CTQCG^3^TG^TG^GACAG^G^AAG^/V\QCCQG^( — 



1 1 1 1 II I 1 1 1 1 1 III I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 

G^CCATTAATGAGATCGGT»\ACATTTAQ30GrG^CG3CrG^Q»\AATTGrATTT 459 
457 Q3GCAAG^G3CAAGCAG^G3Q0CTTGCT0GATGGATGAG^00GTTTTTG^AAATG^CGX 516 
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Db 


460 


0/ 


517 


Db 


520 


0/ 


577 


Db 


580 



II 1 1 1 1 1 I I I I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

460 Q3TQ3AG^TQ3GAACCQ3AQCTTG^"CCTCAATCGATGAMCQG^"TTATG'WV\TG^a 519 



I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I II 



RESULT 3 
ADN60500 

ID ADIM60500 standard; DMA; 1438 BP. 

AC ADIM60500; 

DT 01- JUL- 2004 (first entry) 
XX 

DE B. I i cheni f oriri s sporulation related polynucleotide, seq id 172. 

KW IVUt ant host cell; sporulation; oxi dor educt ase; transferase; hydrolase; 

KW lyase; Isotrerase; I i gase; gene; ds. 

XX 

08 Bacillus I i cheni f orni s. 

PN VVC2003087148- A2. 

PD 23- OCT- 2003. 

PF 25-IVnR-2003; 2003W> DK000200. 

PR 10- APR- 2002; 2002 DK- 00000533. 

PA ( NOVO ) NCVCZYIVES AS. 
XX 

PI Andersen JT, Jorgensen ST, Rasnxissen IVD, Q sen PB, Q ausen IQ 
XX 

DR V\PI ; 2004-122131/ 12. 

DR P-PSDB; ADN60501. 
XX 

PT A Baci I I us I i cheni f or rri s rrut ant host cell for producing a product of 

FT interest e.g. vitarrins, antibiotics and enzymes. 

FS Qaiml; SEQ ID NO 172; 319pp; English. 
XX 

CO The invention relates to a Bacillus 11 cheni f orni s mjt ant host cell 

CO derived from a parent B. I i cheni f or mi s host cell. The nxit ant host cell is 

CO mjt at ed i n one or more genes encoding one or more polypeptides involved 

OC in sporulation. The host cell conprises one or more heterologous genes 

00 present in at least two copies, encoding one or more heterologous 

00 polypeptides. The heterologous genes are stably integrated into the 

00 genorre of the cell without leaving any antibiotic resistance marker genes 

00 at the site of integration. The heterologous genes are transcribed from a 

00 heterologous promDter or from an artificial promoter, and are conprised 

00 i n an operon, preferably a pol yci st r oni c operon. The heterologous 

00 pol ypept i de is an ant i rri cr obi al pept i de, or a f usi on pept i de conpr i si ng a 

00 pept i de par t whi ch in its nat i ve f or m has ant i n1 cr obi al act i vi t y. The 

00 heterologous polypeptide is an enzyme, preferably a secreted enzyme. The 

00 enzyme is an enzyme of a class selected from the group of enzyme classes 

00 consisting of oxi dor educt ases (E0 1), transferases ( EO 2) , hydrolases ( EO 

00 3) , I yases ( EO 4) , i somer ases ( EO 5) , and I i gases ( EO 6) . The Baci I I us 

00 I i cheni f or mi s i s usef ul i n a pr ocess f or pr oduci ng at I east one pr oduct 

00 of interest, comprising cultivating a B. I i cheni f orni s nut ant host cell 

00 in a suitable medium, whereby the said product is produced. The process 

00 further corrprises isolating or purifying the product of interest. The 

00 current sequence represents a B. I 1 cheni f orni s sporulation related 

00 pol ynuci eot i de. 
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SQ Sequence 1438 BP; 421 A; 324 C; 355 Q. 338 T; 0 U; 0 Q her ; 

Query ^/bt ch 25.4% Score 192.2; DB 12; Length 1438; 

Best Local Si m I ar i t y 63.7% Pred. No. 7. 1e-51; 

IVbtches 309; Conservative 0; IVI snatches 173; I ndel s 3; (^ps 1: 

^"Q3CG^TATGAAAAATQGGAG^CAACCATTTCTG^CGAATG^CC- - - AAGTGAAA 57 

II II I I I I I I I I II III I I I I I I I I I I I I I I 



II III! II Mill mill I III I II II III I II 

G^ACTG^TTAAAAACAQ3CAGAAam3^TGAAAAAQCMQ^3A(XT(^ 1 073 

118 AATATCAGACTGGn-CTGGn-aX5rCX?r(XAQ0GCTTTATCAA0CXr^ 177 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I III 
AAGATQGG^"CTTG^"TTQG^"CTG^CG^TGAQCGTTTTTTG^AGAG^Q^3TATG^QCCTG^C 1 133 

GATTTGTTTGAGATCQGTTQCATTQXTTQCTGAAQXCGTTGACAAGTTCG^TCTTTCXB 237 

II I I I I I I I I I I I I I I I I III I I III I II I I I I I I I I I I I I I I I I 
GACCTCTTTGAAATCQQCTQCATCa»JrCTTG^GrCQ3TQ3ACAAATTC«^TCTTT^ 1193 

TACG^TG^G^GATTTTCGACCTATQ3QG^QCGAATG^TCATCGG^G^AATTGAACG3TTT 297 
II II II I I I I I I I I I I I II II II I I I I I I I I I I II II I I I II III 
TATG^OG^TOQG^TTTOGAOCTACG^CG^^(XX3^TG^TTATOGQ3G^GATTGAQCG^ 1253 

TTQ3QCG^TG^C!QG^AC!QG^^AAQG^(^G^C!GAT(:X?^TAAAAGAAACAGC^ 357 

AV<>I^VIilMyi-<ikAli-<m:i-M<mka-c^^ 13,3 



0/ 




Db 




0/ 




Db 




cy 


^118 






0/ 








0/ 


238 


Db 


1 1 94 


0/ 


298 


Db 


1254 


cy 


358 


Db 


1314 


0/ 


418 


Db 


1374 


0/ 


478 


Db 


1434 



5^TT!ici^(xd^TGAQliAATC 1 



373 

QGAG^AQGAG^Q^G^AT(^C!QCm3^QG^AG^AG^CTTTQC!QGAAG^QGGAAQGAG^Q3G 477 

II II II Mill I II II II II I II II II II III 



lUtl4 



RESULT 4 
AEH93992 

ID AEH93992 standard; cDNA; 777 BP. 
XX 

AC AEH93992; 
XX 

DT 27- JUL- 2006 (first entry) 
XX 

DE Si gG codi ng sequence. 
XX 

KW ss; gene; protein production; si gma factor; l=ilMA pol ymer ase; 

KW al l<al i ne protease; food. 

XX 

OS Baci I I us sp. ; KSIV^ 9865. 
XX 

FH Key Locat i on/ dual i f i ers 

FT CDS 1 . . 777 

FT /*tag= a 

FT /product= "Si gG' 

PN JP2006136221- A. 

PD 01- JUN-2006. 

PF 10- IMa/-2004; 2004JP- 00326973. 
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PR 10- NO/- 2004; 2004JP- 00326973. 

PA ( KAOS ) KAO CORP. 
XX 

PI Surritorro N, Okuda T, Takimira Y, Sato T, Kobayashi T; 

DR V\PI ; 2006-385028/40. 

DR P-PSDB; AEH93994. 
XX 

PT Novel spor ul at i on r el at ed gene encodi ng Si gE pr ot ei n or Si gG pr ot ei n 

PT havi ng al kal i ne pr ot ease act i vi t y, usef ul 1 n f oodst uf f s such as a! cohol i c 

PT beverage, bean paste, soy sauce, pharmaceuticals and cosnBtics. 

PS Qaiml; SEQ I D NO 2; 15pp; Japanese. 
XX 

CO This sequence represents the Si gG sequence which encodes the si gma factor 

CC whichisasubunitof RNA pol yner ase. A ni cr oor gani sm t r ansf or vre6 wi t h 

CC t he Si gE or Si gG sequence i s usef ul f or pr oduci ng a pr ot ei n or 

CO pol ypept i de havi ng al kal i ne pr ot ease act i vi t y. The si gum pept i des ar e 

CC useful in foodstuffs e.g. alcoholic beverage, bean paste, soy sauce, 

CC pharmaceuticals and cosmetics. The protein having alkaline protease 

CC activity can be produced efficiently using non- spor ul at ed ni cr oor gani sms. 

SO Sequence 777 BP; 264 A; 123 C; 189 G; 201 T; 0 U; 0 Q her ; 

Query l\/btch 23.6% Score 178.2; DB21; Lengt h 777; 

Best Local Si ni I ar i t y 53.9% Pred. No. 1.8e-46; 

l\/btches 391; Conservative 0; Msmatches 328; I ndel s 6; C^ps 1; 



cy 31 0CATTTCTQ\CGAATGACGAAGrGAAAG^TTTGfliTAGCGAAG^QXAAQ3TQX»\TACG 90 

II I I I I I I I I I I I I I I III I II I I I I I I I I 

Db 49 CCTGrTTTG^G^ATGAAG^AATQCGG^GrTATTTCGrGAAATGCAAGGCGG^G^AOTT 108 

cy 91 G^TGCACG^G^GCTTCTCG^GAATAGCAATATGAG^CTGG^C^GG^CCG^CG^CCAGCGC 150 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 109 TCTGCAAGAG^AAAGGTCGrGAATQGG^ACGAAAGQCTCCTTCTAAGrGTTATTCAACQG 168 

cy 151 TTTATCAACCQCQGG^ATQV\GCQG^TG^TTTG^"TGAG^TCGG^TQCATTGQCTTGC^C 210 

cy 211 AAQ^CCG^TG^CAAG^TCG^TCTTTCG^ACG^TG^G^G^TTTTCGACC^ATQCQG^QOCA 270 

cy 271 ATG^TGATCG3AGAAATTGAACGC^TTTQ0Q0G^TGACQG^ACQG^■AAQG^CAG^CGA 330 

Db 289 kVliVAkVyidiivciVAAQMAUAy^^ 348 

cy 331 TCGrTAAAAG^AAGAQOGAATAAQGrQOGGOGATCAAAQGATGAATTGTACAAQGAATTC 390 

Db 349 TCAWQCGn"GATATTQl:CTACAAQl^^ 408 

cy 391 Q3a:GrQCCa3GACG^TCGGAG^GrQGCAG^GCAGrG3GV\TCACGCCGG<\GG<V\GrA 450 

Db 409 TCAAQQG^Gy3TA!xQGAG^GJv\WTC^^ 468 

Cy 451 G^CTTTQCQGAAGAG^GAAGGAG^GGGCCTTCCTCGATCCATGAGACCG^TTTTG'WV\T 510 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 469 G^TTTTQCTTTAAATGCTATTGAGG^TCCCG^G^CGTTATTTGAGCCTATCTATAATG^T 528 

cy 511 G^CQ3CG^TCCGATGAGACTGATCG^TGAG^TAQGQ3^TGAAGG^G^GAA- GAAG 564 

I I I I I I I I I I I I I II I I I I II I I I II II 

Db 529 QG^Q3AG^TCCG^TTTATG^ATTQG^TCAAATGAG^GACGAAAAGAACAAAG^TATCGAA 588 

cy 565 TGCTTTG^GA^AATTGCCTTGAAQ3^CGXATGAGGAQXTG^QCG^GCCTG^GGAGCTC 624 

I I I I I I I I I I I II II II I I I I I I II II III I I I I I 
Db 589 TGG^TAG^AG^G^TAGCACTAAAAG^AQG^ATG^GACG^CTCAATG^GAQ^G^AAAGCTC 648 
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625 


Db 


649 


0/ 


685 


Db 


709 


0/ 


745 


Db 


769 



II I II II I II II 



■CA 773 

FESULT 5 
ABK74144 

ID ABK741 44 standard; DMA; 381 BP. 

AC ABK74144; 

DT 13- AUG- 2002 (first entry) 
XX 

DE Bacillus I i cheni f orni s genoni c sequence tag (GST) #1435. 
XX 

KW Dfferential gene expression; genoni c sequenced tag; GST; 

KW al t ered cul t ure condi t i on; environmental stress; 

KW physiological provocation; ds. 
XX 

CS Bacillus 11 cheni f orni s. 

PN \AC2002291 13- A2. 

PD 11 -APR- 2002. 

PF 05- OCT- 2001; 2001 US031 437. 

PR 06- OCT- 2000; 2000US- 00680598. 

PR 27-IVAR-2001; 2001 US- 0279526P. 

PA ( NOVO ) NCVQZYIVES Bl OTECH I ISC. 

PA ( NOVO ) NCVQZYIVES AS. 

XX 

PI Ber ka R, CI ausen I G; 

DR VP\ ; 2002-416684/44. 
XX 

PT IVbnitoring differential expression of several genes in first Bacillus 

PT cell relative to expression of same genes in one or more second Bacillus 

PT cells, by using substrate containing Bacillus genoni c sequenced tag 

PT array. 
XX 

PS a aim 4; SEQ I D NO 1435; 200pp; English. 
XX 

CC The invention describes a n^t hod of monitoring differential expression of 

CO genes in a f i r st Baci I I us eel I r el at i ve t o expr essi on of t he genes i n 

CO other Bacillus cells, conprising hybridising labelled nucleic acid probes 

CO i sol at ed f r om Baci I I us eel I s to a subst r at e cont ai ni ng ar r ay of Baci I I us 

CO genoni c sequenced tags (GST), exani ni ng the array, and deternlning 

CO relative gene expression by an observed hybridisation reporter signal of 

CO a spot i n t he array. The nBt hod is useful for measuring the expression of 

CO genes in a f i rst Baci I I us eel I rel at i ve t o expr essi on of t he same genes 

CO i n one or mor e second Baci I I us eel I s. The met hod is usef ul f or moni t or i ng 

CO global expression of several genes from a Bacillus cell, discovering new 

CO genes, identifying possible functions of unknown open reading frames and 

CC mDnitoring gene copy nunber variation and stability. IVbnitoring changes 

CC in expression of genes may be used to provide a representation of the way 

CC i n whi ch Baci I I us eel I s adapt t o changes in cul t ure condi t i ons, 

CC environmental stress or other physiological provocation. Ext ensi ve f ol I ow 

CC -up characterisation is unnecessary, when one spot on an array equals one 
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CC gene or one open r eadi ng f r ame, si nee sequence i nf or mat i on is avai I abl e. 

CC Tiii s sequence represents a genoni c sequence tag (GST) used i n t iie method 

CC of t iie invention. ISbte: The sequence data for this patent did not form 

CC part of the printed specification, but was obtained in electronic format 

CC di r ect I y f r om W PO at f t p. wi po. i nt / pub/ publ i shed_pct _sequences 

SQ Sequence 381 BP; 109 A; 91 C; 94 G; 87 T; 0 U; 0 Q her ; 

Query r^t ch 23.0% Score 173.8; DB 6; Length 381; 

Best Local Similarity 66.9% Pred. No. 3.4e-45; 

IVbtches 247; Conservative 0; MsrrHtches 122; I ndel s 0; Gaps 0 
AAGrGAAAG^TTTG^TAQOCAAG^GCGAAQCTG^CG^TACG:^TQCACG^^GAGCTTCTCG 1 09 

TGAATAQGAATATGAGACTQGn-CTQSrCOGn-CGrCGAQOQCTTTATCMCSCQCXX^ 1 69 

I II II II I II II Mill Mill Mill III I III I II MM 



TTC!G^"ACG^TG^G^G^TTTTC!G^OCTATQC!QG^QC!CAATG^TCATC!QG^GAAATTC 289 

3CTTTTTGCGCG^TGACGG^ACQGrTAAQG^CAGTCGATCG^TAAAAGAAAGAQCGA 349 

MM I M M M M M M M M M M M I I M M M I I 



0/ 


50 


Db 


13 


0/ 


110 


Db 


73 


0/ 


170 


Db 


133 


0/ 


230 


Db 


193 


0/ 


290 


Db 


253 


0/ 


350 


Db 


313 


cy 


410 


Db 


373 



MM I I I II I I II II M 



RESULT 6 
ABK74048 

ID ABK74048 standard; DNA; 631 BP. 

AC ABK74048; 

DT 13- AU3-2002 ( f i r st ent ry) 
XX 

DE Bacillus I i cheni f or ni 8 genoni c sequence tag (GST) #1339. 
XX 

KW Dfferential gene expression; genoni c sequenced tag; GST; 

KW al t ered cul t ure condi 1 1 on; environmental stress; 

KW physiological provocation; ds. 
XX 

OS Bacillus I i cheni f or rri s. 
XX 

PN VC200229113-A2. 



PD 11- APR- 2002, 

PF 05- OCT- 2001 

PR 06- OCT- 2000 

PR 27-M^R-2001 



2001W> US031437. 



2000US- 00680598. 
2001 US- 0279526P. 



( NOVO ) NCVCZYIVES Bl OTECH I NC. 
( NOVO ) NCVCZYI\/ES AS. 
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PI Ber ka R, O ausen I G; 

DR V\PI ; 2002-416684/44. 
XX 

PT IVbnitoring differential expression of several genes in first Bacillus 

PT cell relative to expression of same genes in one or mre second Bacillus 

PT cells, by using substrate containing Bacillus genoni c sequenced tag 

PT array. 

PS a ai m 4; SEQ I D NO 1339; 200pp; English. 
XX 

OC The invention describes a n^t hod of nx)nitoring differential expression of 

OC genes in a first Baci I I us eel I r el at i ve t o expr essi on of t he genes i n 

OC other Bacillus cells, conprising hybridising labelled nucleic acid probes 

00 i sol at ed f r om Baci I I us eel I s to a subst rat e cont ai ni ng ar r ay of Baci I I us 

00 genoni c sequenced tags (GST), exam ni ng the array, and deternining 

00 relative gene expression by an observed hybridisation reporter signal of 

00 a spot i n t he array. The n^t hod is useful for measuring the expression of 

00 genes in a f i rst Baci I I us eel I rel at i ve t o expr essi on of t he same genes 

00 in one or more second Bacillus cells. The method is useful for monitoring 

00 global expression of several genes from a Bacillus cell, discovering new 

00 genes, identifying possible functions of unknown open reading frames and 

00 monitoring gene copy nunter variation and stability. IVbnitoring changes 

00 in expression of genes may be used to provide a representation of the way 

00 i n whi ch Baci I I us eel I s adapt t o changes in cul t ure condi t i ons, 

00 environmental stress or other physiological provocation. Extensive follow 

00 -up characterisation is unnecessary, when one spot on an array equals one 

00 gene or one open r eadi ng f r ame, si nee sequence i nf or mat i on is aval I abl e. 

00 This sequence represents a genomic sequence tag (GST) used i n t he method 

00 of t he invention. Note: The sequence data for this patent did not form 

00 part of the printed specification, but was obtained in electronic format 

00 di r ect I y f r om W PC at f t p. wi po. i nt / pub/ publ i shed_pct _sequences 

SQ Sequence 631 BP; 203 A; 121 C; 161 G; 146 T; 0 U; 0 Q her ; 

Query l\/btch 19.4% Score 147; DB 6; Length 631; 

Best Local Sinilarity 56.8% Pred. No. 2. 3e-36; 

l\/btches 270; Conservative 0; Msmatches 205; I ndel s 0; C^ps 0; 



CV 77 MGC^Q^CG^TACQ3^TQCACG^GAQC^TC^CG^GAATAQCAATATCAG^C^QG^"C^QG^" 136 

Db 98 ATls^l^UlicAl^AACAlixBftUyWVVQy^ 157 

0/ 137 00Gr0Gr00AQCQOrTTAT0AA00QGGQGrAT»V\GCG3^TG^TTTGrTT0AG^T0QGrT 196 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 158 G3GrOATTOAAAQGrTT/V\OAAOA»\GG^GAATATGrTG^TG^CTTATTOOAAGrOGG3T 217 

0/ 197 GCATTGGCTTQOrOAAGGCOGrTGAOAAGTTOG^TCTTTOGrAOGATGrG^GaTTTTOGA 256 

I I I I I I I III I I I II II I II II I I I I I I I I I I I I I I 

Db 218 QCATOGGOiCTAATG^TGAATTGATAATTTTG^CCTG^GCGAGAATGrTAAGrTTTOAA 277 

CV 257 OCTATQCQGTQOCAATG^TCATCQC^GAAATTCAACQOI I I I I GCXSCXBATGACQGTACQG 316 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 
Db 278 OATATGOrGrA0OAATG^T0AT0GGft»W\TO0QGAGATATTTQ3QGGATAAOAA000GA 337 

cy 317 TTAAGGrOAGrOGATOGrTAAAAGAAAGAGGGAATAAGGrG3QQC»\TOAAAG3ATGAAT 376 

I III I I I I II III II I I I I I I I I I I I I II 
Db 338 T00Q3G^0T0A0QG^0A0T0AG^3ATATCQCG^A0AAGXQ0^Q3AAG^G^G^GAA0Q^0 397 

Cy 377 TGrA0AAG0AATT0G300GrQC0000ACG^T0GCA»V\GrGXAGAAG0AGrG3G^T0A 436 

I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 398 TGAT0AG^G^GA0AAG0AG^3AGC0G^CTQCT0AQG^G^T0Q3T/W\GAG3TTGAAG^G^ 457 

Cy 437 0G00GG^GG^CTAG^0TTTG0Q3AAGAQ^GAAQGAGAGCQG0TT00T00AT00ATGAGA 496 

Db 458 (XX^Tl^(iAATC(^TTTT(3t^^ 517 

Cy 497 CCG^TTTTG^AAATQ^CQ^3GATCCCATCACAC^GATC»^TCAG^TAQ0G3ATGA 551 

I III I I I II Mill II I II Mill II Mill 
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518 0GATTTACAATG^C)Q3AQGftGAT(X)G^TTTATGrCATG3^TGAAATCAQDGATGA 572 



RESULT 7 
ADH97013 

I D ADH97013 standard; 
AC ADH97013; 



DMA; 1110 BP. 



XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 



PD 
XX 
PF 



06-IVnY-2004 (first entry) 

S. pneumoniae FilMA polymerase signia-70 factor gene #2. 

ant i bact er i al ; ant 1 1 nf I armiat or y ; gast r oi nt est i nal ; ant i ul cer ; 
ant i di ar r hoei c; opht hal hdI ogi cal ; enzyme i nhi bi t or ; ant i sense t h 
vaccine; microbial target; modulator; furuncle; pneumonia; gast r 
peptic ulcer disease; diarrhoea; meningitis; bacteraenia; conj un^ 
toxic shock syndrome; gene; ds. 



C8 Streptococcus pneurmni ae. 
PN W2003102190- A2. 



JUIM-2003; 2003W> GA000786. 



M'\Y-2002 
M^Y-2002 
J UN- 2002 
J UN- 2002 
J UN- 2002 
J UN- 2002 
J UN- 2002 
J UN- 2002 
J UN- 2002 
J UN- 2002 
J UN- 2002 
JUI^2002: 
JUN-2002 
JUIM-2002 
JLN-2002 
JUN-2002 
JUN-2002 
JUN-2002 
JUN-2002 
JUN-2002 
JUN-2002 
JUN-2002 
JUN-2002: 
JUN-2002 
JUN-2002 
JUL- 2002 
JUL- 2002 
JUL- 2002 
JUL- 2002 
JUL- 2002 
JUL- 2002 
JUL- 2002 
AUG- 2002 
AUG- 2002 
ALX3-2002 
ALX3-2002 
ALX3-2002 
AUG- 2002: 
AUG- 2002 
AUG- 2002: 
AUG- 2002 
AUG- 2002 



2002US- 0384634P. 
2002US- 03851 57P. 
2002US- 0385542P. 
2002US- 0385611 P. 
2002US- 0385747P. 
2002US- 0385750P. 
2002US- 0385752P. 
2002US- 0385773P. 
2002US- 0385780P. 
2002US- 0385785P. 
2002US- 0385797P. 
2002US-0385962P. 
2002US-0386022P. 
2002US-0386024P. 
2002US-0386087P. 
2002US- 0386141 P. 
2002US- 0386350P. 
2002US- 0386586P. 
2002US- 0386368P. 
2002US- 0386369P. 
2002US- 0386436P. 
2002US- 0386441 P. 
2002US-0386528P. 
2002US-0386573P. 
2002US-0386834P. 
2002US-0399839P. 
2002US- 0399861 P. 
2002US- 0399969P. 
2002US- 0399970P. 
2002US- 0399983P. 
2002US- 0399984P. 
2002US- 0399985P. 
2002US- 04001 54P. 
2002US- 0400230P. 
2002US- 0400268P. 
2002US- 0400363P. 
2002US- 0400365P. 
2002US-0400374P. 
2002US-0400380P. 
2002US-0400433P. 
2002US-0400434P. 
2002US-0400436P. 
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PR 01- AUG- 2002; 2002US- 0400442P. 

PR 01- AUG- 2002; 2002US- 0400463P. 

PA ( AFFI - ) AFFI Nl UM PHARM I NC. 

XX 



PI Edwards A, Dharamsi A, Vedadi M Val I ee F, Awrey D, Beat t i e B; 

PI Rchards D, Do mag al a M IVbnsoury K, Virag C, Buzadzija K; 

PI MDdonal d M Houston S, Arrowsnith C, Oiyang H, Net her y K, Ng I ; 

PI Kanagar aj ah D; 



DR Wl ; 2004-071165/07. 

DR P-PSDB; ADH97014. 
XX 

PT Oorrposi t i ons corrprising recontinant polypeptide targets for pathogenic 

PT bacteria, useful for designing mDdulators for preventing or treating a 

PT disease or disorder associated with the species of origin for the 

PT pol ypept i de. 

PS a ai m 23; SEQ I D l\D 204; 606pp; Engl i sh. 
XX 

CO The invention relates to novel conposi t i ons (I) conprising isolated, 

CO recontinant polypeptides, am no acid sequences having at least about 95% 

CO identity with these or an ani no acid sequence encoded by a polynucleotide 

CC that hybridizes under stringent conditions t o t he conpl en^nt ar y strand of 

CC the polynucleotide encoding these polypeptides. The conposi t i ons and 

OC polypeptides are useful as nicrobial targets for designing modulators for 

CC the prevention or treatment of a disease or disorder associated with the 

CC species of origin for the polypeptide, e.g. furuncle, pneun^ni a, 

CC gastritis, peptic ulcer disease, diarrhoea, meningitis, bacteraenia, 

CC conjunctivitis or toxic shock syndroms. The polypeptides are also useful 

CC for di agnosi ng a pat i ent suf f er i ng f r om a di sease or disorder of a 

CC pathogenic species, or for monitoring the effectiveness of an ant i - 

CC pathogenic treatment. This sequence corresponds to one of the DMA 

CC sequences of t he i nvent i on 

SQ Sequence 1110 BP; 341 A; 211 C; 267 Gc, 291 T; 0 U; 0 Q her ; 

Query ^/btch 10.3% Score 78; DB 12; Length 1110; 

Best Local Similarity 50.7% Pred. No. 8. 1e-14; 

Nfetches 214; Conservative 0; Msmatches 205; I ndel s 3; C^ps 1; 



0/ 31 CCATTTCTG^CCAATGACGAAG^GAAAG^TTTG^TAQOGAAG^QCGAAGC^G^CG^TACG 90 

II I I I I I I I I I I I II II II II III I II II II II III 

Db 328 CCTCTCTTG^CGAATGAAG^GG^GAAAG^GTTGXACTGGCTGrTG^GGTGGrG^TATC 387 

0/ 91 G^TGCACGTG^GCTTCTCGrGAATAGGAATATGAGACTGGTCTGGrCCGrCGrCGAGCGC 150 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 388 G^AGCGAAAGAACGrCTTGGQ»V\CGCAATCrTCGrTTGGrTGrTTCCATTQCCAAACGC 447 

0/ 151 TTTATCAACCGCGQGTATGftAGGGG^TG^TTTGrTTGAG^TCGGTTGCATTGGCTTQCrC 210 

Db 448 TAT(^lx3GTlirl3l3CATGGAGrrCCTT(ic!A 507 

0/ 211 AAGGCCG^TG^GAAG^TCG^TCTTTOG^ACG^TG^G^G^TTTTOGAOCTATQCGG^GOCA 270 

Db 508 kklslixBli-TliUvvli-WlicTATT!^ 567 

0/ 271 ATG^TGATCQSAGAAAT- - - TGAAOQCnTTTQCQOGATGAOGGTAOGGrTAAGGrCAGr 327 

Db 568 TQy^TTCG^GA(icTATCACTa?^QC^AWQCQ3^(^^ 627 

cy 328 CGATCGrTAAAAGAAAGAG0GAATAAGGrG0GQCGATGAAAG3AT»\ATTGrACAAGGAA 387 

Db 628 GrTGAGATQ3rTliAAl]TATCAATAAATTl3GrT(D^ 687 

cy 388 TTCG^aDGTQCOCCGACGATCQCAG^AG^GGCAG^AGGAG^Q^G^ATCAOGOOGG^Q^ 447 

Db 688 Wciiciw^UTy^tACCAlBftACAGAWy^ 747 
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0/ 
Db 



448 GT 449 
748 GT 749 



FESULT 8 
AEC13368 

I D AEC13368 St andard; 



AC 
XX 
DT 
XX 
DE 
XX 



DMA; 1110 BP. 



20- OCT- 2005 (first entry) 

Streptococcus pneurrcni ae RIMA pol yrrBr ase sigrra-70 factor gene. 

KW protein purification; antibacterial; ant i ni cr obi a! ; infection; 
drug screening; i^iiSIA pol ymer ase sigma-70 factor; gene; ss. 

Streptococcus pneumoniae. 

US2005181464- A1 . 



I- SEP- 2004; 2004US- 00953901 . 



APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
APR- 2002; 
M^Y-2002; 
M^Y-2002; 
J UN- 2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUi\^2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 
JUN-2002; 



2002US- 036981 9P. 
2002US- 0369826P. 
2002US- 0369831 P. 
2002US- 0370060P. 
2002US- 0370681 P. 
2002US- 0370806P. 
2002US- 0370852P. 
2002US- 0370868P. 
2002US- 0370959P. 
2002US- 0370978P. 
2002US- 0371008P. 
2002US- 0371009P. 
2002US-0371014P. 
2002US-0371025P. 
2002US-0371064P. 
2002US-0371065P. 
2002US- 0371094P. 
2002US- 0371 114P. 
2002US- 03711 SOP. 
2002US- 0371 189P. 
2002US- 0384634P. 
2002US- 03851 57P. 
2002US- 0385542P. 
2002US- 038561 IP. 
2002US-0385747P. 
2002US-0385750P. 
2002US-0385752P. 
2002US-0385773P. 
2002US- 0385780P. 
2002US- 0385785P. 
2002US- 0385797P. 
2002US- 0385962P. 
2002US- 0386022P. 
2002US- 0386024P. 
2002US- 0386087P. 
2002US- 03861 41 P. 
2002US- 0386350P. 
2002US- 0386586P. 
2002US-0386368P. 
2002US-0386369P. 
2002US-0386436P. 
2002US- 0386441 P. 
2002US-0386528P. 
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XX 
DR 
DR 



XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
oc 
oc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
oc 
oc 



06- J UN- 2002; 
06- J UN- 2002; 
31- JUL- 2002; 
31- JUL- 2002; 
31- JUL- 2002; 
31- JUL- 2002; 
31- JUL- 2002; 
31- JUL- 2002; 
31- JUL- 2002; 
01- AUG- 2002; 
01- AUG- 2002; 
01- AUG- 2002; 
01- AU&20G2; 
01- AUG- 2002; 
01- AUG- 2002; 
01- AUG- 2002; 
01- AUG- 2002; 
01- AUG- 2002; 
01- AUG- 2002; 
01- AUG- 2002; 

01- AUG- 2002; 
04- APR- 2003; 
08- APR- 2003; 
08- APR- 2003; 

02- JUN-2003; 



2002US- 0386573P. 
2002US- 0386834P. 
2002US- 0399839P. 
2002US- 0399861 P. 
2002US- 0399969P. 
2002US- 0399970P. 
2002US- 0399983P. 
2002US- 0399984P. 
2002US- 0399985P. 
2002US- 04001 54P. 
2002US- 0400230P. 
2002US- 0400268P. 
2002US-0400363P. 
2002US-0400365P. 
2002US-0400374P. 
2002US-0400380P. 
2002US- 0400433P. 
2002US- 0400434P. 
2002US- 0400436P. 
2002US- 0400442P. 
2002US- 0400463P. 
2003W>CA000465. 
2003W>CA000482. 
2003W>OA000483. 
2003W>GA000786. 



PA ( AFFI - ) AFFI Nl UM PHARM I NO. 



Edwards A, Dharamsi 
Beat t i e B, Buzadzi j £ 
Kanagar aj ah D, Li Q_ 



Qjyang 



R chards D, 



Vedadi M, 
<, CI ar ke T 
IVbnsoury K, 
Val I ee F, 



Al am Arrowsrrith C, Awrey DE; 

IDormgal a W\ Houst on S; 
Mcdonald M Net her y- Br okx K, Ng I ; 
Vi r ag 0; 



W\ ; 2005-628190/64. 
P-PSDB; AEC13369. 

Novel crystallized, recorrbinant bacterial polypeptide, useful as targets 
for pathogenic bacteria such as Helicobacter pylori, Staphylococcus 
aureus, for detecting pathogenic species in biological sanpl e, and in 
drug desi gni ng. 

a ai m 85; SEQ I D NO 204; 637pp; Engl i sh. 

The i nvent i on r el at es to a conposi t i on ( I ) conpr i si ng pur i f i ed 
pol ypept i des f r om bact er i a. AI so descr i bed: ( 1 ) a cr yst al I i zed, 
r ecorrbi nant pol ypept i de conpr i si ng an ani no aci d sequence of ( I ) , wher e 
t he pol ypept i de i s i n cr yst al f or ni (2) a cr yst al I i zed conpl ex conpr i si ng 
t he cr yst al I i zed, r econbi nant pol ypept i de and a co- f act or or a smal I 
organic nx)l ecul e, where the conpl ex is in crystal formt and (3) a host 
eel I conpr i si ng a nuci ei c aci d encodi ng a pol ypept i de of ( I ) , wher e a 
culture of the host cell produces at least about 1 nig of the polypeptide 
per liter of culture and the polypeptide is at least about one- third 
soluble as measured by gel electrophoresis. (I) can be used as a target 
for pathogenic bacteria, useful for detecting the presence of a 
pat hogeni c sped es in a bi ol ogi cal sarrpi e. ( I ) is usef ul f or nx)ni t or i ng 
the effectiveness of ant i - pat hogeni c t r eat nBnt s in an individual 
suffering from a disease or disorder caused by a pathogenic bacteria, 
such as infections. (I) i s al so useful in drug design and screening, for 
identifying inhibitors of (I), for designing a potential conpound that is 
useful for treating or preventing pathogenic diseases or disorders, for 
assessing the activity of small molecules and other modulators in in 
vitro assay, and for developing antimicrobial agents. The present 
sequence represents a Streptococcus pneumoniae RNA pol ymerase signia-70 
factor gene, which is used in an exanpl e from the present invention. 



SQ Sequence 1110 BP; 341 A; 211 0; 267 Q 291 T; 0 U; 0 Q her 



Query l\/bt ch 

Best Local Si ni I ari t y 



10.3% 
50. 7% 



Score 78; 
Pred. No. 



DB 18; Length 1110; 
5. 1e-14; 
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IVbtches 214; Conservative 0; Msmatches 205; I ndel s 3; Gaps 1; 



0/ 31 CCATTTCTG^aW\TGACGMG^GAMG^TTTG^TAQCGAAG^QCGAAQa^QXG^TACX3 90 

Db 328 CCTCTCTTGAcliAT!^ 387 

Cy 91 G^TQ3ACG^"G^Q3TTCTCX^GAATAa:AATATGAGACTQG^"CTQG^"CCG^CX?^C»\QCQC 150 

I I I I I I I I I I I I MINI I I I I I I I I I I I I I I I 

Db 388 G^AQ3GAAAGAACG^CTTQGQGAACa:AATCTTCGTTTQG^"TG^"TTCCATTQGCAAACQC 447 

CV 151 TTTATGAA(XQCQQ3rATGAAQGQG'\TGA,TTTGrTTCA»\TCQGrTQ3ATTQXTTQCTC 210 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 448 TATGrCQGn"CGn"QQGATQGAGn"TCCTTQ!\CTTG^TTCAAGAAQGfl^TATQQXTTGATG 507 

cy 211 AAQXCG^TG^GAAG^TCG^TCTTTCG^"AaBATG^G^G^TTTTCGACCTATQGQG^QO^ 

I I I I I I I I I I I I I I I I I I III II II I I I I I I I I I I I I 
Db 508 AAQXQ3^TG^GAAG^TTG^CTATTC^AAAQ^G^TGAAG^TTTGAACTTATQGAACTTGG 567 

cy 271 ATGMGATCQ3AGAAAT- - - TGAACQCTTTTTQCQCGATGACQGTACQGrTAAQGrCAGn" 327 

III III II II I I I I I I I I I II 

Db 568 TGGATTCGn"GAQ3CTATCACTCGrQarATTQ3Q3^CCAAQCTCX5rACCATCCGrATCCCA 627 

cy 328 CGATCG^TAAAAGAAAGAGCGAATAAQG^QCQQ3GATCm^Q3ATGfllATTGTACAAQCA^ 387 

Db 628 GrTGAGATQ3rTliw\brAT(Jv\TAAATTl3^ 687 

cy 388 TT0Q30CX?rQCmXA0GATCQGAGftAGrQQGAG^AQGAGn-Q3G^AT(^0QCm^ 447 

Db 688 TTQ(3CX3lw\UT(4xi(DAOGAl3^A(iG^ 747 

cy 448 GT 449 

Db 748 GT 749 



RESULT 9 
ADK44581 

ID ADK44581 standard; DMA; 915 BP. 

AG ADK44581 ; 
XX 

DT 24- FEB- 2005 (first entry) 
XX 

DE Streptococcus pneurroni ae gene, Seq ID No 1096. 
XX 

KW ds; gene; Antibacterial; Gene therapy; Vaccine; Streptococcus pneunDni ae. 
XX 

CS Streptococcus pneunDni ae. 

PN US6699703-B1. 
XX 

PD 02-l\^R-2004. 

PF 26- M<\Y-2000; 2000US- 005831 10. 

PR 02- JUL- 1997; 97US- 0051 553P. 

PR 12-M^Y- 1998; 98US- 00851 31 P. 

PR 30-JUN- 1998; 98 US- 001 07433. 

PA ( GEND ) GENOVE THERAPEUTI CS CORP. 
XX 

PI Doucet t e- St amn L, Bush D, Zeng Q Cpperman T, Houseweart CE; 

OR V\PI ; 2004-212399/20. 

OR P-PSDB; ADK47242. 
XX 

PT New nuci ei c acid nx)l ecul es and polypeptides useful for diagnosing, 

PT preventing and treating pathological conditions resulting f r om bact er i al 

PT infection, e.g. Streptococcus pneunoni ae infection, and in drug 
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PT scr eeni ng. 

PS Disclosure; SEQ I D NO 1096; 301pp; English. 
XX 

CC The invention relates to isolated Streptococcus pneumoniae nucleic acids 

CC and polypeptides. The nucleic acids and proteins are useful for 

CC diagnosing, preventing and treating pathological conditions resulting 

CC f r om bact er i al infection, such as S. pneumoniae infection. These may also 

CC be used for drug screening procedures. The present sequence represents a 

CC Streptococcus pneumoniae nucleic acid of the invention. Note: The 

OC sequence data for this patent did not appear in the printed specification 

OC but was obtained in electronic format directly fromUSPTOat 

OC seqdat a. uspt o. gov/ sequence, ht n1 . 

SO Sequence 915 BP; 279 A; 175 0; 220 G; 241 T; 0 U; 0 Q her ; 

Query IVbt oh 10.1% Score 76.4; DB 12; Length 915; 

Best Local Sinilarity 50.5% Pred. No. 2. 4e-13; 

l\/btches 213; Conservative 0; Msmatches 206; I ndel s 3; G&ps 1; 



CV 31 OCATTTCTG^OGAATGAOGAAG^GAMG^TTTG^TAQaDAAG^QCGAAQC^Q^CG^TAOG 90 

Db 328 (iTc!^CTTy^(iAATliv^(iQl3^1i^ 387 

CV 91 G^TQCAOG^■G^QCTTCTOG^GAATAQCAATATGAGACTQG^■C^QG^■OCG^OG^OGAQOQC 150 

Db 388 liAlix^WVOAAyryrrlrixivVGC^ 447 

CV 151 TTTATCAACCGCGQSrATGAAQGGGfl^TG^TTTGTTTCAG^TCGGrTQCATTQXTTGCTC 210 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 448 TATGrCQGTCGrGGrATGGAGrTCCrTGflkOTTG^TTCAAGAAQ»V\ATATGQXTTGATG 507 

CV 211 AAQ3CCGrTG^GAAGrTCGMGTTTCGrrACG^TGrG^G^TTTTCGACCTATQCQGrQCCA 270 

I I I I I I I I I I I I I I I I I I III II II I I I I I I I I I I I I 
Db 508 AAQ^CXI?^TG^GAAG^TTG^CTATTCTAAAG^G^"CAAG^TTTCAAC^TATQCAACTTGG 567 

CV 271 AT»\TGATCG3AGAAATTGAAOQCTTTTTQ0Q0G^TGAOG- - - GTACQGTTAAGGTCAGT 327 

III III II II I I I I I I I I I II 

Db 568 TGG^TTCGTCAG^O^ATCAC^CG^(XTATTQ:^GA^0CAAGCTCG^ACCATCCG^ATCCCA 627 

CV 328 0GAT0G^TAAAAGAAACAGCGAATAAQG^GGGQCGATCAAAG3ATGA^TTG^ACAAGGAA 387 

I I I I I I I I I I I I I II III II III 
Db 628 GrTCAGATG3rTGAAA0rATCAATAAATTGGrTCarGAACAGCG3AATCTCCTTCAA»V\ 687 

CV 388 TTCG3CCGTQCCCCCACGATCGCAG^AGrGQCA»V\GGAGrG3»V\TCACGCCGG^G»V\ 447 

Db 688 TTQcix3C>\AUT(ii(^OGA(3^AGA 747 

CV 448 GY 449 

Db 748 yr 749 



RESULT 10 
AEJ 68509 

ID AEJ68509 standard; DNA; 915 BP. 
XX 

AO AEJ 68509; 

DT 05- OCT- 2006 (first entry) 
XX 

DE Streptococcus pneumoniae strain 14453 pr ot ei n- encodi ng DNA, SEQ 1096. 

KW Vaccine; diagnosis; drug discovery; protein production; 

KW bacterial infection; Streptococcus pneumoniae infection; 

KW bacterial meningitis; bacterial pneumonia; bacterenia; otitis media; 

KW antibacterial; neuroprotective; ant i i nf I amrat ory; r espi r at or y- gen. ; 

KW auditory; gene; ds. 
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OS Streptococcus pneumoniae; strain 14453. 
XX 

FH Key Locat i on/ dual i f i ers 

FT CDS 1. .915 

FT /*tag= a 

FT /product= "Streptococcus pneunDni ae protein SEQ I D 

FT NQ 3757" 

FN US7074914- B1. 

PD 11- JUL- 2006. 

PF 30- DEC- 2004; 2004US- 00028099. 

PR 02- JUL- 1997; 97US- 0051553P. 

PR 12-IVnY- 1998; 98US- 0085131 P. 

PR 30-JUIM- 1998; 98 US- 001 07433. 

PR 26-IVnY-2000; 2000US- 005831 10. 

PR 14- AUG- 2003; 2003US- 00640833. 

PA (SNFI ) SAIMOFI PASTEUR LTEE. 
XX 

PI Doucet t e- St arnn L, Bush D, Zeng Q Cpperman T, Houseweart CE; 

DR V\PI ; 2006-500481/51. 

DR P-PSDB; AEJ71170. 
XX 

PT IMew i sol at ed nucleic acid and polypeptide f rom St rept ococcus pneumoniae, 

PT useful for diagnosing, preventing, or treating pathological conditions 

PT resul t i ng f rom bact eri al infections, e.g. S. pneumoni ae i nf ect i on. 

PS Exanpl e; SEQ ID NO 1096; 29pp; English. 
XX 

CO The invention relates to an isolated nucleic acid, especially (AEJ68056), 

CO which encodes the Streptococcus pneurroni ae protein of AEJ70717. This 

CO nucleic acid is one of 2661 disclosed pr ot ei n- encodi ng nucleic acids 

CO ( AEJ 6741 4- AEJ 70074) isolated from a Streptococcus pneumoniae strain 14453 

CO genorri c library whose predicted products ( AEJ 70075- AEJ 72735) exhibit 

CO hoHDl ogy to known prokaryotic, eukaryotic or archaeal open reading frames 

CO (ORFs) or proteins. The invention also relates to a reconbi nant 

OC expression vector corrprising the nucleic acid of the invention operably 

CO I i nked t o a t r anscr i pt i on r egul at or y el errent ; and a host eel I conpr i si ng 

CO the recombinant expression vector. The Streptococcus pneumoniae nucleic 

CO acids and proteins of the invention are useful for diagnosing, 

CO preventing, or treating pathological conditions resulting f r om bact er i al 

CO infections, especially infections caused by Streptococcus pneunnni ae such 

CO as rreni ngi t i s, bact er eni a, pneunDni a and ot i t i s vre6\ a. They may al so be 

CO used i n vacci ne conposi t i ons f or the treatment of Streptococcus 

CO pneumoniae infections and as targets for antibacterial drugs. 

CO Additionally the nucleic acids are useful i n t he production of 

OC confTTBrcially inportant proteins such as enzymes for use in fermentation 

00 reactions or i n t he production of comTercially useful metabolites. The 

00 present sequence represents a Streptococcus pneumoniae strain 14453 

00 pr ot ei n- encodi ng nuci ei c aci d whi ch was i dent 1 f 1 ed in t he exenpl i f i cat i on 

CO of t he i nvent i on. Note: The sequence dat a f or t hi s pat ent di d not form 

CO part of the printed specification, but was obtained in electronic format 

00 di r ect I y f r om t he US pat ent of f i ce at 

00 seqdat a. uspt o. gov/ sequence, ht ml ?Docl Di707491 481 . 

SQ Sequence 915 BP; 279 A; 175 0; 220 G; 241 T; 0 U; 0 Q her ; 

Query IVbtch 10.1% Score 76.4; DB21; Length 915; 

Best Local Similarity 50.5% Pred. No. 2. 4e-13; 

l\/btches 213; Conservative 0; Msmatches 206; I ndel s 3; C^ps 1; 
Cy 31 CCATTTC^G^CCAATGACCAAG^GAAAG^TTT»^TAGOCAAG^Q0GAAQO^Q^0G^TACG 90 

Db 328 
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91 G^TQ3ACG^"G^Q3TTCTCX?^GMTAQ3MTAT(^GACTQG^"CTQG^"CCG^CX?^CGAQCQC 150 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

88 G^AQ3GAAAGAACG^CTTQCGG^AQGGAATCTTCX3^"TTQ:^^TG^"TTCCATTQGCAAAC^ 447 



CV 151 TTTATGAACCQCQQ3^ATG^QGQG^TG^TTTG^TTCAG^TCQG^TQCATTQXTTQCTC 210 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 448 TATG^CQG^"CG^"QG^ATQGAG^"TCCTTG^CTTG^TTCAAGAAQG^AATATQQXTTGATG 507 

Cy 211 AAQXCG^TG^GAAG^TCG^TCTTTCG^"ACG^TG^G^G^TTTTCGACCTATQGQG^QCCA 270 

I I I I I I I I I I I I I I I I I I III I I II I I I I I I I I I I I I 

Db 508 AAQXQ3^TG^GAAG^TTG^CTATTCTAAAQ^G^TGAAG^TTTGAACTTATQGAACTTQG 567 

CV 271 ATG^TGAT0G3AGAAATTCAA0QJ^ITTTQ0Q0G^TGACX> - - GTACX^CT 327 

Db 568 TQy\TTCXHrCAciCTAT(^CT(DbrQG!rAWQGA^ 627 

CV 328 0GATCGrrAAAAGAAAGAQC!GAATAAQGrQC!QGC!GATGAAAQ3ATG^ATTGn-A(^QG^ 387 

Db 628 GrTCACATQ3rTGl4\ACTAT(J\ATMATT 687 

CV 388 TT0Q30CCn-QCmXA0GATCQGAG^AGrQQGAG^AQGAGn-Q3G^AT(^0a 447 

Db 688 W(iidLAAUT(4xi(DAOGAl3^A(iGAWl^ 747 

Cy 448 GT 449 

Db 748 yr 749 

RESULT 11 
AEJ 75484 

ID AEJ75484 standard; DMA; 915 BP. 

AC AEJ 75484; 

DT 05- OCT- 2006 (first entry) 
XX 

DE Streptococcus pneumoniae strain 14453 pr ot ei n- encodi ng DIMA, SEQ 1096. 
XX 

KW Vaccine; diagnosis; drug discovery; protein production; 

KW bacterial infection; Streptococcus pneunDni ae infection; 

KW bacterial n^ningitis; bacterial pneumonia; bacterema; otitis media; 

KW antibacterial; neuroprotective; antiinflammatory; r espi r at or y- gen. ; 

KW audi t ory; gene; ds. 

XX 

CS Streptococcus pneumoniae; strain 14453. 
XX 

FH Key Locat i on/ dual i f i ers 

FT CDS 1. .915 

FT /*tag= a 

FT /product= "Streptococcus pneumoniae protein SEQ I D 

FT NQ 3757" 

PN US7081530- B1. 

PD 25- JUL- 2006. 

PF 30- DEC- 2004; 2004US- 00028291 . 

PR 02- JUL- 1997; 97US- 0051 553P. 

PR 12-M^Y- 1998; 98US- 00851 31 P. 

PR 30-JUN- 1998; 98 US- 001 07433. 

PR 26- l\AY-2000; 2000US- 005831 10. 

PR 14- AUG- 2003; 2003US- 00640833. 
XX 

PA (SNFI ) SANOFI PASTEUR LTEE. 
XX 

PI Doucet t e- St amm L, Bush D, Zeng Q Qoperman T, Houseweart CE; 
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DR V\PI ; 2006- 518920/ 53. 

DR P-PSDB; AEJ78145. 
XX 

PT New i sol at ed Streptococcus pneumoniae nucleic acid, useful as a nx)l ecul ar 

PT target for detecting, diagnosing, preventing, or treating a pat hoi ogi cal 

PT condition resulting f r om bact er i a! infection. 
XX 

PS Exatrpi e; SEQ I D |VJO 1096; 29pp; English. 
XX 

CC The invention relates to an isolated nucleic acid, especially (AEJ75443), 

CC which encodes the Streptococcus pneunnni ae protein of AEJ78104. This 

CC nucleic acid is one of 2661 disclosed pr ot ei n- encodi ng nucleic acids 

CC ( AEJ 74389- AEJ 77049) isolated from a Streptococcus pneunDni ae strain 14453 

CC genoni c library whose predicted products ( AEJ77050- AEJ79710) exhibit 

CC homology to known prokaryotic, eukaryotic or archaeal open reading frames 

CC (ORFs) or proteins. The invention also relates to a recont)i nant 

CC expression vector conprising the nucleic acid of the invention operably 

CC I i nked t o a t ranscri pt i on regul at ory el ement ; and a host eel I conpri si ng 

CC the recont)i nant expression vector. The Streptococcus pneumoniae nucleic 

CC acids and proteins of the invention are useful for diagnosing, 

CC preventing, or treating pathological conditions resulting f r om bact er i al 

CC infections, especially infections caused by Streptococcus pneumoni ae such 

CC as meni ngi t i s, bact er emi a, pneumoni a and ot i t i s medi a. They may al so be 

CC used in vaccine cotrposi t i ons for the treatment of Streptococcus 

CC pneumoniae infections and as targets for antibacterial drugs. 

CC Additionally the nucleic acids are useful i n t he production of 

CC conmercially inportant proteins such as enzymes for use in fermentation 

CC reactions or i n t he production of conrnerci al I y useful metabolites. The 

CC present sequence represents a Streptococcus pneumoniae strain 14453 

CC pr ot ei n- encodi ng nuci ei c aci d whi ch was i dent i f i ed in t he exempi i f i cat i on 

CC of t he i nvent i on. Note: The sequence dat a f or t hi s pat ent di d not form 

CC part of the printed specification, but was obtained in electronic format 

CC di r ect I y f r om t he US pat ent of f i ce at 

CC seqdat a. uspt o. gov/ sequence, ht n1 ?Docl Ct7081 530B1 . 

SQ Sequence 915 BP; 279 A; 175 C; 220 G; 241 T; 0 U; 0 Q her ; 

Query ^/btch 10.1% Score 76.4; DB21; Length 915; 

Best Local Similarity 50.5% Pred. No. 2. 4e-13; 

Nfetches 213; Conservative 0; Msmatches 206; I ndel s 3; C^ps 1; 



0/ 31 OCATTTCTG^CCAATGACCAAG^GAAAG^TTTG^TAQOGAAG^QCGAAQC^Q^CGATACG 90 

II I I I I I I I I I I I II II II II III I II II II II III 

Db 328 CCTCTCTTG^CGAATGAAG^QG^GAAAG^GrTGXACTG3CTGrT»V\GCTG3rG^TATC 387 

0/ 91 G^TQCACGTG^GCTTCTCGTGAATAGaoATATGAGACTGGTCTGGTCCGrCGrCCAQCQC 150 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 388 G^AQCCAAACAACG^CTTGCQGA^GGCAATCTTCa^TTGGTTG^TTCCATTQCCAAACQC 447 

0/ 151 TTTATCAACCQCQQ3rATGftAQCQG^TG^TTTGrTTGAG^TCQGn"TGCATTQ33TTQCTC 210 

Db 448 TAT(^lx3Gn-lirlirATQGAGnTCCnT(icTT 507 

0/ 211 AAGGCCG^TG^CAAG^TCG^TC^TTOG^"ACG^TG^G^G^TTTTOGAOCTATQCQG^QOCA 270 

Db 508 kklslixBli-TliUvvli-WlicTATT!^ 567 

0/ 271 ATG^TGATCQ3AGAAATTCAA0QC^TTTQ0Q0G^TGA0G■ - - GTACQGrTAAQGrCAGT 327 

Db 568 TQG^TTOGn-QAcicTATCACTlirQjrAWQCAG^OGM 627 

cy 328 CGATCGrTAAAAGAAAGAGDGAATAAQGrQ0QQCGATGAAAQ3AT»\ATTGn"ACAAQGAA 387 

Db 628 GrTGAGATQ3rTliAAl]TATCAATAAATTl3GrT(D^ 687 

cy 388 TTOQ^OOG^■QCaXGAOGATCQCAG^AG^QQCAG^AQGAG^■Q^G^ATCAOQOCQG^a 447 

Db 688 Wciiciw^UTy^tAOCAlBftAGAGAWl^ 747 
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Cy 448 449 

Db 748 GT 749 

FESULT 12 
AEJ 82844 

ID AEJ82844 standard; DMA; 915 BP. 

AC AEJ 82844; 

DT 19-OCT-2006 (first entry) 
XX 

DE Streptococcus pneurroni ae strain 14453 pr ot el n- encodi ng DMA, SEQ:1096. 
XX 

KW Vaccine; diagnosis; drug discovery; protein production; 

KW bacterial infection; Streptococcus pneunDni ae infection; 

KW bacterial n^ningitis; bacterial pneunDni a; bacterenia; otitis media; 

KW antibacterial; neuroprotective; ant i i nf I anrrat ory; r espi r at or y- gen. ; 

KW audi t ory; gene; ds. 
XX 

CS Streptococcus pneunDni ae; strain 14453. 
XX 

FH Key Locat i on/ dial i f i ers 

FT CDS 1. .915 

FT /*tag= a 

FT /product= "Streptococcus pneumoniae protein SEQ I D 

FT NQ 3757" 

PN US7098023- B1 . 

PD 29- AUG- 2006. 

PF 30- DEC- 2004; 2004US- 00027878. 

PR 02- JUL- 1997; 97US- 0051 553P. 

PR 12-M^Y- 1998; 98US- 00851 31 P. 

PR 30-JUN- 1998; 98 US- 001 07433. 

PR 26-M^Y-2000; 2000US- 005831 1 0. 

PR 14- AUG- 2003; 2003US- 00640833. 

PA (SNFI ) SANOFI PASTEUR LTD. 

PI Doucet t e- St amn L, Bush D, Zeng Q Cpperman T, Houseweart CE; 

OR W\ ; 2006- 584390/ 60. 

OR P-PSDB; AEJ 85505. 
XX 

PT New isolated Streptococcus pneunDni ae nucleic acid and polypeptide, 

PT useful as vaccines and as targets for diagnosing, preventing, or treating 

PT pathological conditions resulting fromS. pneumoniae bacterial infection. 

PS Exanpl e; SEQ I D ND 1096; 29pp; English. 
XX 

CO The invention relates to an isolated nucleic acid, especially (AEJ84330), 

CO which encodes the Streptococcus pneunDni ae protein of AEJ86991. This 

CO nucleic acid is one of 2661 disclosed pr ot ei n- encodi ng nucleic acids 

CO ( AEJ81 749- AEJ84409) isolated from a Streptococcus pneunDni ae strain 14453 

CO genonl c library whose predicted products ( AEJ8441 0- AEJ87070) exhibit 

CO honDl ogy to known pr ol<ar yot i c, eukaryotic or archaeal open reading frames 

CO (ORFs) or proteins. The invention also relates to a recontinant 

CO expression vector conprising the nucleic acid of the invention operably 

CO I i nked t o a t r anscr i pt i on r egul at ory el enBnt ; and a host eel I conpr i si ng 

CO the recontinant expression vector. The Streptococcus pneumoniae nucleic 

CO acids and proteins of the invention are useful for diagnosing, 

CC preventing, or treating pathological conditions resulting f r om bact er i al 

CC infections, especially infections caused by Streptococcus pneumoniae such 

CC as meningitis, bacterenia, pneumoni a and otitis media. They may also be 

CC used in vaccine conpositions for the treatment of Streptococcus 
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CC pneurroni ae infections and as targets for antibacterial drugs. 

CC Additionally the nucleic acids are useful i n t he production of 

CC conmer ci al I y inportant proteins such as enzymes for use in fernBntation 

CC reactions or i n t he production of conTrercially useful nBtabolites. The 

CC present sequence represents a Streptococcus pneumoniae strain 14453 

CC pr of ei n- encodi ng nuci ei c aci d whi ch was i dent 1 f 1 ed in t he exenpl 1 f 1 cat i on 

CC of t he i nvent i on. Note: The sequence dat a f or t hi s pat ent di d not form 

CC part of the printed specification, but was obtained in electronic format 

CC di r ect I y f r om t he US pat ent of f i ce at 

CC seqdat a. uspt o. gov/ sequence, ht n1 ?Docl [>7098023B1 . 

SQ Sequence 915 BP; 279 A; 175 C; 220 G; 241 T; 0 U; 0 Q her ; 

Query IVbtch 10.1% Score 76.4; DB21; Length 915; 

Best Local Sinilarity 50.5% Pred. No. 2. 4e-13; 

IVbtches 213; Conservative 0; Msmatches 206; I ndel s 3; C^ps 1; 



0/ 31 CCATTTCTG^CCAATGACCAAGTGAAAG^TTTG^TAQCCAAG^Q3CAAQC^Q^CG^TACG 90 

Db 328 CCTCTCTTGACGAAtI^ 387 

Cy 91 G^TQCACG^G^QCTTCTCG^GAATAQGAATATCAGACTQ3TCTQ3TCCG^CG^CCAQCQC 150 

I I I I I I I I I I I I Mill I II II I I I I I I I I I I 
Db 388 G^AGCCAAACAACG^CTTG3QG^AQ3GAATC^TCG^TTGa^TG^TTTCCATTGXAAACGC 447 

0/ 151 TTTATGAA00Q0QQarAT»\AG3QGMGMTTGmCAGM0GGrTGCATTG3CTTQCTC 210 

Db 448 TATCTCQGrliTQGTATQGAGrrTCcA 507 

0/ 211 AAQ^CCG^TG^CAAG^TCG^TCTTTCG^"Aa5^TG^G^G^TTTTCGACCTATQCQG^Q0GA 270 

I I I I I I I I I I I I I I I I I I III II II I I I I I I I I I I I I 
Db 508 AAQ^CQ3^TG^CAAG^TTG^CTATTCTAAAQ^G^TCAAG^TTTCAACTTATQCAAC^"TQG 567 

0/ 271 ATG^TGATCQ3AGAAATTGAACGCTTTTTQCQCG^TGACG- - - GTACQGTTAAQGTCAGn" 327 

III III II II I I I I I I I I I II 

Db 568 TQG^TTCG^■GAQ^CTATCACT0G^QGTATTQ3A»\CCAAQCTCGTACCATCCG^ATCCCA 627 

0/ 328 a3^TCGrTAAAAGAAAGAGCGAATAAQGrQCQQGGATCAAAQ3ATGV\TTGn'ACAAQGAA 387 

Db 628 GrTCACATQ3rTliw\brATCAATAAATTlx?rT(^ 687 

CV 388 TT0Q30C!Gn-QCmXAC!GATCQGAG^AGrQQGAG^AQGAGn-Q3GftATCAC!QC^ 447 

Db 688 TTQCBCBciAAUTlixitAOGAlB^A 747 

0/ 448 GT 449 

Db 748 GT 749 



RESULT 13 
AEL05163 

ID AEL05163 standard; DMA; 915 BP. 

AC AEL05163; 

DT 30- NO/- 2006 (f i rst ent ry) 
XX 

DE Streptococcus pneumoniae strain 14453 pr of ei n- encodi ng DNA, SEQ:1096. 
XX 

KW Vaccine; diagnosis; drug discovery; protein production; 

KW bacterial infection; St r apt ococcus pneumoni ae i nf ect i on; 

KW bacterial nBningitis; bacterial pneumonia; bacterema; otitis npdi a; 

KW antibacterial; neuroprotective; ant i i nf I ammt ory; r espi r at or y- gen. ; 

KW auditory; gene; ds. 

XX 

CS streptococcus pneumoniae; strain 14453. 
XX 

FH Key Locat i on/ dial i f i era 
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FT CDS 1. .915 

FT /*tag= a 

FT /product= "Streptococcus pneurroni ae protein SEQ I D 

FT NQ 3757" 

PN US7115731-B1. 

PD 03- OCT- 2006. 

PF 30- DEC- 2004; 2004US- 00027399. 

PR 02- JUL- 1997; 97US- 0051 553P. 

PR 12-M'\Y- 1998; 98US- 0085131 P. 

PR 30-JUIM- 1998; 98 US- 001 07433. 

PR 26-M^Y-2000; 2000US- 005831 10. 

PR 14- AUG- 2003; 2003US- 00640833. 

PA (SNFI ) SANOFI PASTEUR LTD. 
XX 

PI Doucet t e- St arrm L, Bush D, Zeng 0, Cpperman T, Houseweart CE; 
XX 

DR Wl ; 2006- 744050/ 76. 

DR P-PSDB; AEL07824. 
XX 

PT New nuci ei c acid encoding Streptococcus pneumoniae polypeptide, useful 

PT for detecting, preventing, and treating pathological conditions resulting 

PT f rem bact er i al infection. 

PS Exanpl e; SEQ ID NO 1096; 29pp; English. 
XX 

CO The invention relates to an isolated nucleic acid, especially (AEL05123), 

CO which encodes the Streptococcus pneurroni ae protein of AEL07784. This 

CO nucleic acid is one of 2661 disclosed pr ot ei n- encodi ng nucleic acids 

CO ( AEL04068- AEL06728) isolated from a Streptococcus pneunnni ae strain 14453 

CO genorri c library whose predicted products ( AEL06729- AEL09389) exhibit 

CO horrol ogy to known prokaryotic, eukaryotic or archaeal open reading francs 

CO (ORFs) or proteins. The invention also relates to a recont)inant 

CO expression vector corrprising the nucleic acid of the invention operably 

CO I i nked t 0 a t r anscr i pt i on r egul at ory el ement ; and a host cell conprising 

CO the recorrbi nant expression vector. The Streptococcus pneumoniae nucleic 

CO acids and proteins of the invention are useful for diagnosing, 

CO preventing, or treating pathological conditions resulting f r om bact er i al 

CO infections, especially infections caused by Streptococcus pneumoniae such 

CO as n^ni ngi t i s, bact er erri a, pneumani a and ot i t i s n^di a. They may al so be 

CO used i n vacci ne corrposi t i ons f or the treatment of Streptococcus 

CO pneumoniae infections and as targets for antibacterial drugs. 

CO Additionally the nucleic acids are useful i n t he production of 

CO commercially irrportant proteins such as enzymes for use in fermentation 

CO reactions or i n t he production of conrrBrci all y useful metabolites. The 

CO present sequence represents a Streptococcus pneumoniae strain 14453 

OC pr ot ei n- encodi ng nucI ei c aci d whi ch was i dent i f i ed in t he exenpl i f i cat i on 

OC of t he invention. IVbte: The sequence data for this patent did not form 

CO part of the printed specification, but was obtained in electronic format 

CO di r ect I y f r om t he US pat ent of f i ce at 

CO seqdat a. uspt o. gov/ sequence, ht rrl ?Docl D=71 1 5731 81 . 

SO Sequence 915 BP; 279 A; 175 C; 220 G; 241 T; 0 U; 0 Q her ; 

Query r^t ch 10.1% Score 76.4; DB21; Length 915; 

Best Local Similarity 50.5% Pred. No. 2.4e-13; 

IVbtches 213; Conservative 0; MsrrBtches 206; I ndel s 3; Gaps 1; 

Cy 31 CCATTTC^G^CCAATGACCAAG^GAAAG^TTTG^TAGCCAAG^QCCAAQO^Q^CG^TACG 90 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 328 CCTC^C^TG^aCAATGAAG^GG^GAAA»\GrTGXACTQ3C^G^TG^Q0^QG^G^TATC 387 

cy 91 G^TQCACG^G^QCTTCTCG^GAATAGCAATATGAGACTQG^C^QG^CCG^CG^CCAGCGC 150 

Db 388 GAAQCCAAACAACCTcWqCC^^ 447 
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1 5 1 TTTATCAA(XQCQQ3^ATG^AGGQG^TG^TTTG^^TGAG^TCQG^TQ[)ATTQ^CJ^TQCTC 2 1 0 
448 TATCT(:X3Gn'y3n'QGrATQCAGnTCX^ 507 

211 AAQ^3CX?^TG^CAAG^TCX^TC^^TC)G^■A(:x3^TG^G^G^TTTTCC^ 270 

508 kkliixii-TliUvvli-WliCTATTbr 567 

271 ATG^TCATCX^3^GAAATTCAACX3:^^ITTTQC!GOG^TGAC!G■ - - CTAOQGrTAAGGTGAGn" 327 

568 TQy\TTCX?rCAci3CTAT(^CT(DbrQG!rAWQ^ 627 

328 C!GATCGrrAAAAGAAACAQC)GAATAAQGrQC!Q3C!GATGAAAQ3ATG^ATTGn"A^ 387 

I I I I I I I I I I I I I II III II III 
628 GrTCACATG3rTGAAACTAT(^TAAATTQGrT0GrGAACAGCX33^TCT(XnTC^ 687 



CV 388 TT0Q^00a^QCXXXX:Aa3ATCQGAG^AGTGQGAG^AQGAGTQ^G^ATCACQCCQG^QG^A 447 

Db 688 TTaiQCW\G^TCCGAGAG^ 747 

cy 448 GY 449 

I I 

Db 748 GT 749 

RESULT 14 
AEL12413 

ID AEL12413 standard; DMA; 915 BP. 

AC AEL12413; 

DT 28- DEO 2006 (f i rst ent ry) 
XX 

DE Streptococcus pneurroni ae strain 14453 pr ot ei n- encodi ng DNA, SEQ:1096. 
XX 

KW Vaccine; diagnosis; drug discovery; protein production; 

KW bacterial infection; St r ept ococcus pneumoni ae i nf ect i on; 

KW bacterial TOningitls; bacterial pneumonia; bacterema; otitis n^di a; 

KW antibacterial; neuroprotective; ant i i nf I anrrat ory; r espi r at or y- gen. ; 

KW auditory; gene; ds. 

XX 

C6 Streptococcus pneumoniae; strain 14453. 

XX 

FH Key Locat i on/ dial i f i ers 

FT CDS 1. .915 

FT /*tag= a 

FT /product= "Streptococcus pneumoniae protein SEQ I D 

FT NQ 3757" 

PN US71 22368- B1. 

PD 17- OCT- 2006. 

PF 30- DEC- 2004; 2004US- 00027877. 

PR 02- JUL- 1997; 97US- 0051 553P. 

PR 12-M^Y- 1998; 98US- 00851 31 P. 

PR 30-JUN- 1998; 98US- 001 07433. 

PR 26-M^Y-2000; 2000US- 005831 1 0. 

PR 14- AUG- 2003; 2003US- 00640833. 

PA (SNFI ) SANOFI PASTEUR LTD. 
XX 

PI Doucet t e- St anm L, Busfi D, Zeng Q Cpperman T, Houseweart CE; 

DR VP\ ; 2006-812612/82. 

DR P-PSDB; AEL15074. 
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PT New i sol at ed nuci ei c aci d and pol ypept i de i sol at ed f r om st r ai n 1 4453 of 
PT Streptococcus pneurroni ae, useful for diagnosing, preventing, or treating 
PT bacterial infections, e.g. S. pneumoniae infection. 

PS Exanpl e; SEQIDNO 1096; 29pp; English. 
XX 

CC The invention relates to an isolated nucleic acid, especially (AEL12919), 

CC which encodes the Streptococcus pneumoniae protein of AEL15580. This 

CC nucleic acid is one of 2661 disclosed pr ot ei n- encodi ng nucleic acids 

CC (AEL11 318- AEL1 3978) isolated from a Streptococcus pneumoniae strain 14453 

CC genomic library whose predicted products ( AEL1 3979- AEL1 6639) exhibit 

CC homology to known prokaryotic, eukaryotic or archaeal open reading frames 

CC (ORFs) or proteins. The invention also relates to a recombinant 

CO expression vector conprising the nucleic acid of the invention operably 

OC li nked t 0 a t ranscri pt i on regul at ory el ement ; and a host cell conprising 

CC the recombinant expression vector. The Streptococcus pneumoniae nucleic 

CC acids and proteins of the invention are useful for diagnosing, 

CC preventing, or treating pathological conditions resulting f r om bact er i al 

CC infections, especially infections caused by Streptococcus pneumoniae such 

CC as meni ngi t i s, bact er emi a, pneumani a and ot i t i s medi a. They may al so be 

CC used i n vacci ne composi t i ons f or the treatment of Streptococcus 

CC pneumoniae infections and as targets for antibacterial drugs. 

CC Additionally the nucleic acids are useful i n t he production of 

CC commercially important proteins such as enzymes for use in f er rrent at i on 

CC reactions or i n t he production of comrErcially useful metabolites. The 

OC present sequence represents a Streptococcus pneumoniae strain 14453 

CC pr ot ei n- encodi ng nucI ei c aci d whi ch was i dent i f i ed in t he exenpl i f i cat i on 

CC of t he i nvent i on. Note: The sequence dat a f or t hi s pat ent di d not form 

CC part of the printed specification, but was obtained in electronic format 

CC di r ect I y f r om t he US pat ent of f i ce at 

CC seqdat a. uspt o. gov/ sequence, ht ml ?Docl D=7122368B1. 

SQ Sequence 915 BP; 279 A; 175 C; 220 G; 241 T; 0 U; 0 Q her ; 

Query ^/btch 10.1% Score 76.4; DB21; Length 915; 

Best Local Si ni I ar i t y 50.5% Pred. No. 2.4e-13; 

l\/btches 213; Conservative 0; MsrrHtches 206; I ndel s 3; C^ps 1; 

CCATTTCTG^CCAATGACCAAG^GAAAG^TTTG^TAQOCAAG^QXAAQC^GGCG^TACG 90 

G^TQCAGG^■G^QGTTCTCG^GAATAQCAATATCAGACTQG^■CTQG^■GCG^CG^CGAQOQC 1 50 

G^AQCxwvvcAAyrcrrcr^ 447 

TTTATCAAG0Q0Q93^ATG^AQCQG^TG^TTTG^■TCAG^TGQG^TQCATTQ^GTTQC^C 210 
TAT(i-lx3Gn-(DbrlirATQGAGnTCCTTlic!A 507 
AAQ^CGG^TG^CAAG^TCG^TCTTTGG^■ACG^TG^G^G^TTTTGGAGCTATQCQG^QOCA 270 

Mill lllllllllll II III I I II Mill II Mill 

AAQ3CGGrTG^GAAGrTTG^CTATTCTAAAQ3GrrGAAGrTTTCAACTTATGCAACrTQG 567 

ATG^TCATCQGAGAAATTCAACGCTTTTTGGQCG^TGACG- - - GTACQGrTAAQGrCAGT 327 

Ml III II M II I I II II I II 



TTCQ3CCG^QCCCCCACGATCQGAG^AG^GQCAG^AQGAG^Q^G^ATCACQCCQG^QG^A 447 

II II I I MM III II II II II II II II II II I 





31 


Db 


328 


cy 


91 


Db 


388 


cy 


151 


Db 


448 


cy 


211 


Db 


508 


cy 


271 


Db 


568 


cy 


328 


Db 


628 


cy 


388 


Db 


688 


cy 


448 


Db 


748 
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RESULT 15 
AEL50821 

ID AEL50821 standard; DNA; 915 BP. 

AC AEL50821; 

DT 28- DEC- 2006 (f i rst ent ry) 
XX 

DE Streptococcus pneurroni ae strain 14453 pr ot ei n- encodi ng DNA, SEQ 1096. 

KW Vaccine; diagnosis; drug discovery; protein production; 

KW bacterial infection; Streptococcus pneumoniae infection; 

KW bacterial n^ningitis; bacterial pneumonia; bacterema; otitis oBdi a; 

KW antibacterial; neuroprotective; ant i i nf I anrrat ory; r espi r at or y- gen. ; 

KW auditory; gene; ds. 

XX 

OS streptococcus pneumoniae; strain 14453. 
XX 

FH Key Locat i on/ dual i f i er s 

FT CDS 1. .915 

FT /*tag= a 

FT /product= "Streptococcus pneumoniae protein SEQ I D 

FT ISO 3757" 

PN US71 29340- B1. 

PD 31- OCT- 2006. 

PF 30- DEC- 2004; 2004US- 00028457. 
XX 

PR 02- JUL- 1997; 97US- 0051 553P. 

PR 12- M'W- 1998; 98US- 00851 31 P. 

PR 30-JUN- 1998; 98US- 001 07433. 

PR 26-M^Y-2000; 2000US- 005831 1 0. 

PR 14- AUG- 2003; 2003US- 00640833. 

PA (SNFI ) SANOFI PASTEUR LTD. 
XX 

PI Doucet t e- St anrn L, Bush D, Zeng Q Qoperman T, Houseweart CE; 

DR V\PI ; 2006- 812716/ 82. 

DR P-PSDB; AEL53482. 
XX 

PT New i sol at ed nuci ei c aci d and pol ypept i de i sol at ed f r om St r ept ococcus 

PT pneunnni ae, useful as corrponent s of antibacterial vaccines, and for 

PT diagnosing or treating S. pneunDni ae and other Streptococcus infections. 

PS Exanpl e; SEQ I D NO 1096; 29pp; English. 
XX 

CC The invention relates to an isolated nucleic acid, especially (AEL52290), 

CO which encodes the Streptococcus pneumoniae protein of AEL54951. This 

CC nucleic acid is one of 2661 disclosed pr ot ei n- encodi ng nucleic acids 

00 ( AEL49726- AEL52386) isolated from a Streptococcus pneumoniae strain 14453 

CO genonl c library whose predicted products ( AEL52387- AEL55047) exhibit 

CO hoHDl ogy to known prokaryotic, eukaryotic or archaeal open reading francs 

CO (ORFs) or proteins. The invention also relates to a recontinant 

CO expression vector conprising the nucleic acid of the invention operably 

00 I i nked t o a t r anscr i pt i on r egul at ory el evren[ ; and a host eel I conpr i si ng 

00 the reconti nant expression vector. The Streptococcus pneumoniae nucleic 

GO acids and proteins of the invention are useful for diagnosing, 

OC preventing, or treating pathological conditions resulting f r om bact er i al 

OC infections, especially infections caused by Streptococcus pneumoniae such 

OC as meningitis, bacterema, pneumonia and otitis media. They may also be 

OC used in vaccine corrposi t i ons for the treatment of Streptococcus 

OC pneumoniae infections and as targets for antibacterial drugs. 

00 Additionally the nucleic acids are useful i n t he production of 

00 commercially irrportant proteins such as enzymes for use in fernBntation 
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CC reactions or i n t he production of comTercialiy useful nBtabolites. The 

CC present sequence represents a Streptococcus pneumoniae strain 14453 

CC pr ot ei n- encodi ng nuci ei c aci d whi ch was i dent i f i ed in t he exenpl i f i cat i on 

CC of t he i nvent i on. ISbte: The sequence dat a f or t hi s pat ent di d not form 

CC part of the printed specification, but was obtained in electronic format 

CC di r ect I y f r om t he US pat ent of f i ce at 

CC seqdat a. uspt o. gov/ sequence, ht rrl ?Docl D=7129340B1. 

SQ Sequence 915 BP; 279 A; 175 C; 220 G; 241 T; 0 U; 0 Q her ; 

Query r^t ch 10.1% Score 76.4; DB21; Length 915; 

Best Local Si ni I ar i t y 50.5% Pred. No. 2. 4e-13; 

retches 213; Conservative 0; Msmatches 206; I ndel s 3; Caps 1 



0/ 


31 


Db 


328 


0/ 


91 


Db 


388 


CV 


151 


Db 


448 


0/ 


211 


Db 


508 


0/ 


271 


Db 


568 


0/ 


328 


Db 


628 


0/ 


388 


Db 


688 


cy 


448 


Db 


748 



1 1 1 1 1 1 

^CCAATG 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

G^AGCCAAACAACG^CTTG3Q»V\Q0GAATCTTCGTTTQGTTGTTTCCATTGCCAAACGC 447 



M I 1 1 1 I 1 1 III II II I 1 1 II II 



5AGAAATTCAAOG3TTTTTGOGOGMGAOG- - - GTAOGGnAAGGTCAGr 327 

III II I I II I I MM I II 



TTGQQGGAAG^T(4xi^UoGAG^A(^ 747 



Search conpl et ed: Noventer 4, 2008, 17:14:39 
Job t i m9 : 247 sees 
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